Back

Mobile DNA

Springer Science and Business Media LLC

Preprints posted in the last 30 days, ranked by how well they match Mobile DNA's content profile, based on 27 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.

1
Lifestyles of Gypsy-family transposons shape their regulatory mechanisms

Papameletiou, A.-M.; Czech Nicholson, B.; Bornelöv, S.; Hannon, G. J.

2026-05-21 genomics 10.64898/2026.05.19.726053 medRxiv
Top 0.1%
4.3%
Show abstract

Transposable elements are a highly diverse group of selfish genomic elements, prevalent across the tree of life, whose uncontrolled propagation poses a threat to genome stability. Recent studies have explored the evolution of Drosophila melanogaster transposable elements, their co-evolution with the host genome, and mechanisms that regulate their activity. However, little is known about their cross-species evolutionary patterns. Long terminal repeat (LTR) retrotransposons are the most active group of transposable elements in Drosophila. They are broadly separated into retroelements, which are active in the germline, and insect endogenous retroviruses that are active in the soma. Somatic elements are hypothesised to infect the germline through their acquisition of virus-derived proteins such as Envelope and sORF2, thus multiplying through successive generations. In this study, we curated the sequences of LTR retrotransposons in 249 drosophilid genomes, allowing us to study their evolution across these species and highlight their varying degrees of conservation. Furthermore, we reveal multiple instances of Envelope protein loss or inactivation that suggest shifts in the expression pattern of these transposons, likely accompanied by adopting different transcriptional control mechanisms. We contrast this with the evolutionary history of sORF2, which we found to be much more stable. Lastly, we examined variations in transposon LTR regions responsible for transcriptional regulation and use predictive modelling to suggest six transcription factors likely involved in their tissue-specific expression. Altogether, we reveal complex, interspecies evolutionary patterns of Gypsy-family LTR retrotransposons and highlight examples of their co-evolution with their host genome.

2
Cp36 serine recombinase as a new tool for zebrafish transgenesis

Thrikawala, S.; Naples, B.; Rosowski, E.

2026-05-11 genetics 10.64898/2026.05.06.723361 medRxiv
Top 0.1%
1.7%
Show abstract

One feature key to the versatility of zebrafish as an animal model for biomedical research is the breadth of genetic tools available, including for transgenesis. While the Tol2 transposase system remains the gold standard, its efficiency can be highly variable. Here, we explored the potential of a complementary transgenesis system, Cp36, a large serine recombinase identified from Clostridium perfringens previously found to efficiently integrate target cargo into the human genome without a preinstalled attB site. We generated Cp36-based plasmid constructs for zebrafish transgenesis and compared their performance to matched Tol2 plasmids across multiple experimental contexts, including transient expression, germline transmission, and multi-transgene expression. Cp36 integrates small [~]3.5kb cargo into the zebrafish genome and transmits to the next generation as efficiently as Tol2, but Cp36 performance declines substantially for larger [~]7.5kb constructs. Both Cp36 and Tol2 have comparable efficiency in transiently expressing a second construct regardless of the transposase/recombinase used to integrate the first construct, indicating compatibility with sequential transgenesis strategies. In summary, we demonstrate that Cp36 functions as a new alternative transgenesis method in zebrafish.

3
Distribution of prophage-encoded Pas sRNAs across pathogenic Escherichia coli

Zhu, D. X.; Shabalina, S. A.; Storz, G.

2026-05-14 evolutionary biology 10.64898/2026.05.13.724894 medRxiv
Top 0.1%
1.6%
Show abstract

Numerous base pairing small RNAs (sRNAs), which are an integral part of regulatory networks in bacteria, are encoded on mobile genetic elements (MGEs) in pathogenic strains of Escherichia coli. These sRNAs help coordinate the expression of MGE-encoded virulence factors with core genome-encoded cellular pathways. To investigate the evolution of MGE-encoded sRNAs, we queried public databases to characterize the distribution of PasA, PasB, PasC, PasD1, and PasD2, five prophage-encoded sRNAs discovered in enteropathogenic E. coli. We find that while the Pas sRNAs are largely restricted to pathogenic lineages of Escherichia and Shigella, they exhibit diversity in sequence, genomic presence, and copy number across strains. Based on phylogenetic analysis, the Pas sRNAs originate from multiple ancestral lineages and associate with specific E. coli pathovars, consistent with horizontal acquisition followed by retention. Syntenic analysis suggests a phage origin for the Pas sRNAs, likely from Shiga-toxin encoding phages, but the sRNAs appear to have diverged substantially following their integration into bacterial chromosomes. Comparative and structural analyses further suggest that the PasA and PasC sRNAs share a common ancestor as is the case for PasD and STnc100, another prophage-encoded sRNA. These findings add to our understanding of how accessory genome-encoded sRNAs emerge and evolve.

4
IGS38, a lncRNA from the human rDNA intergenic spacer, regulates rRNA transcription by altering rDNA chromatin organisation and activating the transcription machinery

Tariq, K.; Polenkowski, M.; Quin, J.; Sugathan, A.; Isacson, S.; Jakobsson, S.; Enervald, E.; von Euler, A.; Öst, A.; Visa, N.; Östlund Farrants, A.-K.

2026-05-04 cell biology 10.64898/2026.05.02.722362 medRxiv
Top 0.1%
1.6%
Show abstract

The eukaryotic ribosomal genes are multi-copy genes, transcribed from the rDNA, and approximately one third of them is actively transcribed in differentiated cells. A number of lncRNAs have been identified from the intergenic spacer between the rRNA genes, among those the spacer RNA and PAPAS that are involved silencing of rRNA gene copies by altering the chromatin configuration. Here, we have identified lncRNAs that are transcribed from the human rDNA loci and modulate the loci; IGS38 positively regulates rRNA gene transcription by associating to the 47S rRNA gene promoter and modulating the rRNA promoter accessibility while IGS32as associates with heterochromatin. IGS38 binds to the 47S gene promoter through the RNA pol I factors TAF1C and RRN3 as well as the Williams Syndrome Transcription Factor (WSTF), a component of the B-WICH chromatin remodelling complex. The increased accessibility of the promoter stabilises the architectural protein Upstream Binding Factor (UBF) at the rRNA promoter, thereby facilitating RNA pol I promoter escape. Furthermore, IGS38 knock down displays and increased dsRNA abundance in the cytoplasm with a weak induction of the dsRNA sensor OAS2, typically induced by interferon and viral dsRNA. Overall, the both IGS38 and IGS32as are chromatin associated lncRNAs involved in rDNA chromatin changes, and IGS38 is stimulating, together with WSTF, rRNA gene transcription in human cells. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=199 HEIGHT=200 SRC="FIGDIR/small/722362v1_ufig1.gif" ALT="Figure 1"> View larger version (29K): org.highwire.dtl.DTLVardef@14d4159org.highwire.dtl.DTLVardef@fd773forg.highwire.dtl.DTLVardef@a0030dorg.highwire.dtl.DTLVardef@1285301_HPS_FORMAT_FIGEXP M_FIG C_FIG IGS stabilises 47S rRNA transcription, disruption of IGS38 expression leads to the release of dsRNA in the cytoplasm and a weak immune activation of OAS2. Created by biorender (https://biorender.com/shortURL)

5
Isotope-Free Mapping of protein:RNA Interactions Using fCRAC and trxtools

Robertson, N.; Mikolajczyk, J.; Garcia-Sandoval, A. C.; Helwak, A.; Major, M. L.; Emadali, A.; Tollervey, D.; Turowski, T. W.

2026-05-21 molecular biology 10.64898/2026.05.19.726220 medRxiv
Top 0.1%
1.3%
Show abstract

Defining high-confidence RNA interaction sites for specific proteins is essential to understand RNA biology, but existing methods face trade-offs between specificity, sensitivity, and experimental accessibility. Here, we present fluorescent Cross-linking and analysis of cDNAs (fCRAC), a mammalian-cell optimized update to the CRAC protocol. In fCRAC, a fluorescent adaptor is used, in place of radiolabeling, to visualize RNA-protein complexes during gel-purification. fCRAC retains the tandem affinity purification and stringent, denaturing conditions of classical CRAC, enabling nucleotide-resolution mapping of protein:RNA interactions with high signal to noise ratio. We initially tested fCRAC using RPP25L, a component the RNase MRP and RNase P complexes. RPP25L almost exclusively bound to predicted, single sites in the RNA components (RMRP and RPPH1), showing excellent selectivity with nucleotide resolution. To support analysis of UV cross-linking data for more complex targets, we developed the trxtools package and example pipeline for standardized processing, quality control, and analysis of data from fCRAC and related methods. We include tailored strategies for repetitive RNA classes, such as tRNA and rRNA, which can be challenging to analyze using other approaches. We applied fCRAC and trxtools to define the RNA interactome of human CYCLON/CCDC86, a nuclear protein previously implicated in oncogenesis. This revealed specific interactions with rRNA, tRNA and ncRNAs involved in pre-rRNA and pre-tRNA processing. HighlightsO_LINucleotide-resolution definition of RNAs interacting with specific proteins, including rRNA and tRNA C_LIO_LIStringent denaturing purifications and robust visualization steps, with no requirement for radioactive labelling C_LIO_LITrxtools provides an integrated analysis pipeline with approaches for analyzing both single and multi-copy RNA species C_LI

6
Evidence for independent retroviral syncytin-like Env endogenization in non-placental chondrichthyans

Proudley, E.; Reddin, I. G.; Cleal, J. K.; Lewis, R. M.; Laundon, D.

2026-05-07 evolutionary biology 10.64898/2026.05.06.723177 medRxiv
Top 0.3%
0.5%
Show abstract

Viviparity and placentation are remarkable examples of convergent evolution across vertebrates. The evolution of the uniquely intimate mammalian placenta has been associated with the repeated independent capture of fusogenic retroviral Env proteins, called syncytins. Research into syncytin capture has therefore been predominantly focused on resolving their central role in mammalian placentation. As such, the presence of syncytin-like Env proteins outside of mammals, and their role in non-placental physiological contexts, remain much less understood. We expanded this understanding by systematically surveying genomes from 36 chondrichthyan species (sharks, rays, skates, and chimaeras), which display a wide range of independently evolved placental and non-placental reproductive strategies, for the presence of syncytin-like Env genes. We identified 295 candidate syncytin-like Env proteins from 16 chondrichthyan species, with a subset displaying conserved fusogenic domains, structural homology with known syncytins, and genomic signatures of endogenization. Using transcriptomic data from the model catshark Scyliorhinus canicula, we found that syncytin-like Env genes are transcriptionally active in diverse adult tissue types. Using two closely related species of Squalus (spiny dogfish), we present evidence that endogenized Env genes are syntenically conserved, indicative of vertical transmission from a common ancestor before species divergence. Notably, we detected no candidates in any placental shark genome, suggesting that syncytin-like Env capture is not a feature of shark placentation. Our findings expand the known phylogenetic breadth and functional scope of syncytin-like Env protein endogenization beyond mammalian placentation, providing a solid foundation for future investigations into the wider role of retroviral capture in vertebrate biology and evolution.

7
The gene-like promoter and transcription of LTR retrotransposons.

Gooden, C.; Li, X.; Walter, I.; Ou, S.

2026-05-12 plant biology 10.64898/2026.05.08.723830 medRxiv
Top 0.3%
0.5%
Show abstract

Transcriptional regulation is one of the fundamental approaches for young plants to cope with environmental fluctuations and maintain active development. The transposable element (TE) subclass long terminal-repeat retrotransposons (LTR-RTs) can act as additional regulators for genes through enhancer and promoter activity, but their promoters, transcription initiation, and contributions during maize development remain uncharacterized. Here, we developed IsoClassifier to resolve the transcription start site (TSS) and RNA isoforms of LTR-RTs based on long-read transcriptomics, delineating LTR U3 regions as the native promoter and enhancer of LTR-RTs. We reveal conserved motifs associated with core promoter activity in transcribed LTR-RTs that are highly comparable to gene promoters. Further, we found that LTR-RT transcription in maize was dominated by spliced, long non-coding RNA. Finally, a genome-wide coexpression analysis revealed that LTR-RTs are transcribed as hub-like elements in coexpression networks, suggesting important roles in gene regulation. We conclude that LTR-RTs have similar promoter compositions to gene promoters and likely share similar transcription regulation programs.

8
Identification of rare pseudouridylated microRNA by comprehensive small RNA bisulfite sequencing of mouse and human tissues

Fagre, C.; Gilbert, W. V.

2026-05-15 molecular biology 10.64898/2026.05.14.725264 medRxiv
Top 0.3%
0.4%
Show abstract

Pseudouridine ({Psi}) is an important post-transcriptional modification of many noncoding RNAs that is under-characterized in microRNA (miRNA) due to historical limitations in pseudouridine mapping methods. {Psi} modification stabilizes RNA duplex structures and could therefore play an important role in miRNA target binding and repression. To investigate the extent to which mammalian miRNAs are modified with {Psi}, we profiled the modification landscape of short (<30 nt) RNA in human cells and mouse tissues using bisulfite sequencing. Our approach was powered to detect small RNA pseudouridylation based on robust detection of known {Psi} positions in tRNA fragments (tRFs), some of which show tissue-specific patterns of modification. In contrast with tRFs, we find that miRNA pseudouridylation is exceedingly rare, with a single modified miRNA (miR-3068-5p) identified in mouse tissues. Pseudouridylated miR-3068-5p diSerentially repressed predicted miRNA targets with less stable miRNA:mRNA pairing modes. This study fills a long-standing gap in transcriptome-wide {Psi} profiling and reveals a new potential function for {Psi} as a modulator of activity of small regulatory RNAs.

9
In the Drosophila germline H2Av and Arp6 suppress transposons by driving piRNA pathway expression

Andrasi, N.; Ryon, H. M.; Luo, Y.; Fejes Toth, K.

2026-05-21 molecular biology 10.64898/2026.05.19.726329 medRxiv
Top 0.3%
0.4%
Show abstract

The spatiotemporal control of transcription and the maintenance of germline genome integrity depend on dynamic chromatin architecture. In Drosophila, the actin-related protein Arp6--a core subunit of the SWR1-like Domino chromatin remodeling complex--mediates the deposition of the histone variant H2Av. Previous studies have established H2Av as a key transcriptional regulator that modulates the +1 nucleosome barrier to promote RNA Polymerase II (Pol II) pause release and productive elongation. Conversely, H2Av is also integral to heterochromatin assembly and gene silencing. Here we demonstrate that Arp6 and H2Av are essential for female fertility and the global repression of transposable elements (TEs) in the Drosophila ovary. Rather than repressing TEs directly, we show that Arp6 and H2Av maintain genomic stability indirectly by driving the transcription of core PIWI-interacting RNA (piRNA) pathway genes. Depletion of either chromatin factor leads to a significant loss of piRNAs and reduced non-canonical transcription of dual-strand piRNA clusters. This defect stems from a failure to express the Rhino-Deadlock-Cutoff (RDC) complex, alongside the downregulation of multiple other piRNA biogenesis factors. Genomic profiling confirms that H2Av acts predominantly as an activating signal at host gene promoters. Upon H2Av or Arp6 depletion, genes that rely on H2Av for their expression exhibit a distinct upstream shift and more precise spatial localization of the Pol II peak at the TSS, indicating an impaired transition from transcription initiation into productive elongation. Together, our findings build upon the known transcriptional activation functions of the Arp6-H2Av axis, revealing that this established chromatin mechanism is critical for licensing piRNA-mediated genome defense and ensuring germline maintenance.

10
Toward a probabilistic definition of chromatin accessible regions at the single-cell level

Sanchez-Escabias, E.; Rico, D.; Reyes, J. C.

2026-05-04 genomics 10.64898/2026.05.01.722232 medRxiv
Top 0.4%
0.4%
Show abstract

Understanding cis-regulatory elements (CREs) at the single cell level is fundamental to deciphering transcriptional changes during development, cell differentiation, and homeostasis. Recent studies have shown that arbitrary peak-calling thresholds complicate data interpretation and cross-study comparisons. Furthermore, due to the inherent sparsity of single-nuclei ATAC-seq (snATAC-seq) data, distinguishing between truly inaccessible regions and technical dropouts remains challenging. Our analysis of snATAC-seq experiments performed in a well-established cell line suggests that the dichotomy between accessible (open) or inaccessible (close) CREs is misleading. Thousands of accessible regions are present in a very small fraction of cells of the population but they are repeatedly identified, suggesting that they have a low accessibility or are only transiently accessible. However, depending on the detection threshold selected they could be considered as either genuine CREs or noise. To resolve this inconsistency, we propose a model where chromatin accessibility is treated as a continuum, defined by a probability of accessibility (Pa) for each accessible region across cell types and conditions. Through computational simulations, we demonstrate that snATAC-seq results can be explained by a simple "balls into bins" probability model, offering a theoretical framework for calculating Pa distributions from any snATAC-seq dataset. Furthermore, we examine how Pa distributions shift following activation of the TGF{beta} signaling pathway. This probabilistic approach removes the reliance on arbitrary thresholds and supports a more quantitative, and dynamic understanding of accessible regions function.

11
Landscape of retron diversity across the SPIRE prokaryotic metagenome resource reveals candidate novel type XI-like lineages

Toro, N.

2026-05-14 microbiology 10.64898/2026.05.14.725207 medRxiv
Top 0.4%
0.4%
Show abstract

Retrons are prokaryotic genetic elements encoding a specialized reverse transcriptase (RT) that synthesizes multicopy single-stranded DNA (msDNA) and are increasingly recognized as components of bacterial anti-phage defense systems. However, their diversity and ecological distribution across large-scale genomic resources remain poorly characterized. Here, we surveyed retron RTs across the SPIRE representative metagenome collection, a non-redundant, species-level dataset spanning diverse microbial habitats. Using type-specific hidden Markov models cross-calibrated against all known retron groups, we identified retrons representing all canonical types together with additional divergent lineages. Retron distribution was highly structured across both taxonomy and ecology, with some lineages restricted to specific bacterial phyla and others broadly distributed across habitats. Host-associated environments accounted for the largest fraction of retron candidates, although several retron types showed clear specialization for aquatic or anthropogenic niches. Systematic novelty assessment identified two candidate type XI-like lineages, TXI_C2like and TXI_noncan_h, characterized by protease-independent architectures and lineage-specific accessory modules. De novo computational analyses further identified candidate msr/msd-like non-coding RNA elements upstream of the RT genes in both lineages, indicating that these highly divergent systems retain the defining RT-ncRNA architecture of canonical retrons. Together, these findings expand the known diversity of retron systems and reveal a strong interplay between evolutionary history and ecological specialization.

12
LVV SMRTcap reveals extensive proviral variation in lentiviral vector-transduced CAR T cells

Kaiser, C.; Sadri, G.; Elliott, E. M.; Mroczkowska, J. J.; Ankita, J.; Ferguson, M.; Bushman, F.; Fraietta, J. A.; Rouchka, E. C.; Smith, M.

2026-05-15 cancer biology 10.64898/2026.05.13.724601 medRxiv
Top 0.4%
0.3%
Show abstract

Lentiviral vectors are commonly used to introduce chimeric antigen receptor transgenes into T cells, but routine assays quantify vector copy number or integration sites without sequencing full-length integrated vectors. HIV-1 proviruses often acquire large deletions and cytidine deaminase-driven hypermutation; whether similar variation occurs in therapeutic lentiviral vectors is unclear. We adapted a novel long-read capture approach to enrich long fragments spanning vector DNA and adjacent human sequence, enabling simultaneous integration-site mapping and proviral integrity analysis with single-molecule resolution. In research-grade CAR T cells produced with an experimental, transient-transfection lentiviral vector workflow, 40% of integrated vectors carried recurrent deletions that removed the internal promoter or parts of the chimeric antigen receptor cassette. The dominant promoter deletion was present in the viral stock. In clinical chimeric antigen receptor T cell products, promoter deletions were less frequent, but detectable pre-infusion and post-infusion. Across datasets we observed widespread G-to-A substitutions consistent with restriction factor editing, including changes predicted to introduce premature stop codons within the transgene open reading frame. Our method reveals proviral variants invisible to standard quality-control assays and provides a framework to improve vector production and monitor transgene integrity in clinical products.

13
Epigenetic splicing regulation of hTERT mediated by hTAPAS

Patterson, V. S. J.; Limbach, A. G.; Dogruyol, J. A.; Kraemer, C.; Erichsen, L.

2026-05-12 molecular biology 10.64898/2026.05.08.723733 medRxiv
Top 0.5%
0.3%
Show abstract

Telomerase activity is primarily controlled by expression of the catalytic subunit hTERT, yet the mechanisms coordinating its transcriptional and post-transcriptional regulation remain incompletely understood. Paradoxically, hypermethylation of a CpG-rich region upstream of the hTERT transcription start site, termed the TERT Hypermethylated Oncological Region (THOR), is strongly associated with hTERT activation. Recent studies suggest that THOR methylation influences the expression of hTAPAS, a long non-coding RNA transcribed antisense to the hTERT promoter that represses hTERT expression. Here, we investigated the relationship between DNA methylation and alternative splicing of hTERT. Using CRISPR-Cas9-mediated targeted enrichment combined with Nanopore sequencing, we generated high-resolution DNA methylation maps across several kilobases surrounding the hTERT promoter and intronic regions encompassing exons 6-8 in multiple human cell lines. These analyses revealed distinct methylation signatures that correlate with specific hTERT splice isoform profiles. Functional perturbation experiments further demonstrated that altering the epigenetic state of the locus influences hTERT splicing. Overexpression of the antisense lncRNA hTAPAS, as well as treatment with the DNA demethylating agent 5'-Azacytidine, reduced CpG methylation at the hTERT promoter and induced a shift in hTERT splice isoform distribution. Together, our findings identify DNA methylation as an upstream regulator of hTERT alternative splicing and implicate the antisense lncRNA hTAPAS in shaping this regulatory landscape. The results point to an epigenetic mechanism linking hTAPAS to hTERT promoter methylation and splice isoform selection at the hTERT locus and provide new insight into how telomerase regulation may be remodeled during development and in cancer.

14
SnoRNA Expression and RNA 2'-O-Methylation in Drosophila melanogaster S2 Cells

Ye, X.; Liu, Y.; Olson, S.; Zhan, L.; Carmichael, G. G.; Graveley, B.

2026-05-22 genomics 10.64898/2026.05.21.726978 medRxiv
Top 0.6%
0.3%
Show abstract

Small nucleolar RNAs (snoRNAs) are a class of non-coding RNAs that play critical roles in guiding 2-O-methylation (Nm) and pseudouridylation modifications of RNAs. In Drosophila melanogaster, snoRNAs undergo dynamic changes in expression during development. In this study, we identified 239 snoRNAs that are robustly expressed in Drosophila S2 cells, representing 87% of all annotated Drosophila snoRNAs. Given that box C/D snoRNAs guide site-specific 2-O-methylation (Nm) of RNA, we next characterized the Nm landscape of S2 cells using RibOxi-seq2, a high-throughput approach capable of detecting Nm modifications with single-nucleotide resolution. RibOxi-seq2 revealed 17 Nm sites in 18S rRNA with a 94% concordance to previously reported RiboMeth-Seq data. In 28S rRNA, 30 Nm sites were identified, corresponding to an 71.4% overlap with established references. Additionally, we detected both a known Nm site (Gm74) and a novel site (Um66) in 5.8S rRNA, further validating the sensitivity and specificity of the approach. RibOxi-seq2 further identified Nm sites in small nuclear RNAs (snRNAs), expanding the annotation of modified non-coding RNAs. Additionally, the method revealed Nm modifications within internal regions of mRNAs. In total, we detected Nm modifications in 2,057 unique mRNAs, underscoring the widespread presence of this epitranscriptomic modification in coding transcripts. Strikingly, although we could not identify any snoRNAs predicted to guide the mRNA 2-O-methylation modifications by canonical mechanisms, we identified strong consensus sequences surrounding many of these mRNA sites. Together, our findings not only expand the known landscape of Nm-modified RNAs but also highlight the robustness of RibOxi-seq2 for transcriptome-wide RNA modification profiling. Collectively, this study presents a comprehensive atlas of snoRNA expression and 2-O-methylation sites in Drosophila S2 cells, offering valuable insights into the epitranscriptomic landscape orchestrated by snoRNAs.

15
RT-nested and interfering-Primer PCR reveal prevalent isoform-specific A-to-I RNA editing in neuronal genes

Wang, Z.; Ni, Y.; Cai, W.; Li, H.; Duan, Y.

2026-05-17 molecular biology 10.64898/2026.05.15.725286 medRxiv
Top 0.6%
0.2%
Show abstract

BackgroundMetazoan adenosine-to-inosine (A-to-I) mRNA editing temporospatially diversifies the neuronal transcriptome and proteome. The limited read length from next-generation sequencing (NGS) constrains the quantification of the potentially differential editing levels across different splicing isoforms, restricting our understanding of the extent to which RNA editing contributes to molecular diversity and its interplay with splicing. MethodsWe employed reverse transcription nested PCR (RT-nPCR) and developed a novel interfering-Primer PCR (iPrimer PCR) technique to distinguish different transcripts of any gene. We selected multiple essential genes exhibiting RNA editing in coding sequences (CDSs) or untranslated regions (UTRs) for isoform-specific amplification and Sanger sequencing. ResultsNine different Adar isoforms together with pre-mRNA had distinct editing levels at the S>G auto-recoding site, which was predicted to have isoform-specific effects on catalytic activities. Although pre-mRNA editing might exert isoform-dependent promotion/suppression of splicing, closely located editing sites, such as those in neuronal genes qvr and stj, still exhibited high correlation in editing levels due to co-editing. iPrimer strategy further discovered differential recoding levels between the long/short 3UTR isoforms of gene jef. ConclusionsWe provide the first comprehensive solution for isoform-specific PCR amplification of any gene, enabling quantification of RNA editing level of different isoforms. Our results offer insights into how RNA editing interplays with splicing, and highlight its complicated role in expanding molecular diversity. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=79 SRC="FIGDIR/small/725286v1_ufig1.gif" ALT="Figure 1"> View larger version (17K): org.highwire.dtl.DTLVardef@1ebc82org.highwire.dtl.DTLVardef@1ea365dorg.highwire.dtl.DTLVardef@1971aceorg.highwire.dtl.DTLVardef@160d053_HPS_FORMAT_FIGEXP M_FIG C_FIG We developed isoform-specific PCR followed by Sanger sequencing, and achieved the quantification of differential RNA editing levels in different transcripts of a gene.

16
Establishment of titration-based control of DNA replication in Escherichia coli

Adiego-Perez, B.; Fluit, D.; Ludwig, C.; Berger, M.; Hohlbein, J.; Staals, R. H.; ten Wolde, P. R.; van der Oost, J.; Claassens, N. J.; Olivi, L.

2026-05-09 molecular biology 10.64898/2026.05.06.723188 medRxiv
Top 0.6%
0.2%
Show abstract

Escherichia coli couples the initiation of DNA replication with cell size by modulating the activity of the replication initiator protein DnaA. The activity of DnaA is regulated by both its interconversion between an active and inactive form and its titration on binding sites on the chromosome. Whereas its interconversion has been thoroughly studied, the extent to which DnaA titration can control replication initiation is poorly understood. Here, we describe the control of E. coli DNA replication via titration by modulating the expression of an always-active DnaA variant in four growth conditions. While we obtained stable cell cycles during slow growth, faster growth associated with overlapping replication forks led to replicative instability and DNA damage. Overall, our results provide insights into the limits of titration-based systems in the control of genome replication and their potential role in the evolutionary trajectory of E. coli. Finally, this study provides design principles for a simplified, titration-only regulatory mechanism for DNA replication in synthetic cells.

17
Mutational and bioinformatic analysis of the binding site for the ribonucleotide reductase-specific transcriptional repressor NrdR

Shahid, S.; Lundin, D.; Rozman Grinberg, I.; Sjöberg, B.-M.

2026-05-14 molecular biology 10.64898/2026.05.11.724285 medRxiv
Top 0.7%
0.2%
Show abstract

The prevalent transcriptional repressor NrdR binds to highly conserved prokaryotic sequences in the promoter regions of operons encoding the essential enzyme ribonucleotide reductase. The NrdR binding sites consist of two partially palindromic 16 bp sequences (NrdR boxes) separated by a 15-16 bp linker sequence. We have assessed the requirement of both boxes for binding, the propensity of different NrdRs to bind to heterologous binding sites, and that the linker sequence is only limited to length and not sequence conservation. As we have observed several deviations from the conserved sequences of the NrdR boxes, we here test the conservation requirements of individual basepairs in the NrdR boxes using a synthetic DNA fragment (Synt DNA) to which the NrdR proteins from the actinomycete Streptomyces coelicolor and the gammaproteobacterium Escherichia coli bind equally well as to their homologous binding sites. By introducing isolated mutations to Synt DNA and testing the binding capacity of NrdR from S. coelicolor and E. coli we expand our understanding of what criteria are needed to build a functional binding site for the NrdR repressor.

18
Retroelement Hypomethylation Links Hypoxia Signaling, Immune Phenotypes, and Survival in Clear Cell Renal Cell Carcinoma

Nnam, C. F.; Salas, L.; Mboya, E. A.; Li, Y.; Zhang, M.; Kolling, F.; Perrard, L.; Palys, T. J.; Pflugradt, E.; Pioli, P. A.; Ernstoff, M. S.; Seigne, J. D.; Pettus, J. R.; Ren, B.; Song, L.; Christensen, B. C.

2026-05-06 cancer biology 10.64898/2026.05.01.722263 medRxiv
Top 0.7%
0.2%
Show abstract

BackgroundRetrotransposable elements (RE) comprise approximately 45% of the human genome and are typically repressed by DNA methylation to preserve genomic integrity. In cancer, global DNA hypomethylation can lead to RE derepression, resulting in genomic instability and activation of innate immune pathways through viral mimicry. While individual RE classes have been examined in clear cell renal cell carcinoma (ccRCC), the integrated epigenetic landscape of multiple RE families and their clinical relevance remain incompletely characterized. MethodsWe performed a genome-wide prediction of DNA methylation across three major RE classes (Alu, LINE-1, and LTR elements) using a validated computational framework applied to Illumina methylation array data from two independent ccRCC tumor cohorts. Integrated unsupervised clustering of RE methylation profiles was used to define the epigenetic subtypes. Associations with clinicopathologic variables, tumor immune microenvironment composition (DNA Methylation-derived), hypoxia signaling, innate immune activation, and overall survival were evaluated. Prognostic relevance was assessed using multivariable Cox regression models adjusting for age, sex, AJCC stage or AUA risk group, and immune and angiogenic tumor microenvironment features. Key findings were then externally validated in CPTAC-ccRCC and independently replicated in an institutional Dartmouth Cancer Center (DCC) cohort with matched methylation and RNA-sequencing data. ResultsIntegrated clustering identified three reproducible RE methylation subtypes, Repressed, Transient, and Active. In the discovery cohort, the Active subtype showed significantly worse overall survival than the Repressed subtype, with a graded survival pattern across RE methylation states that persisted after multivariable adjustment. RE hypomethylation was associated with reduced EPAS1 (HIF2A) expression, increased immune infiltration, elevated PD-1 expression, and heightened cGAS-STING and interferon signaling, consistent with an immune-inflamed yet immunosuppressed tumor state. In the external CPTAC validation cohort, RE methylation subtypes recapitulated key molecular features and showed supportive survival trends. In the independent DCC replication cohort, an Active RE state was again associated with poorer survival, lower EPAS1 expression, increased PD-1 expression, greater CD8 T-cell and Treg infiltration, and elevated T-cell exhaustion signatures, supporting the reproducibility of the prognostic and immune-exhausted phenotype across cohorts. ConclusionsWe identified RE methylation subtypes with distinct molecular, immunologic, and prognostic features in ccRCC. External validation in CPTAC and independent replication in DCC support the robustness of this RE methylation framework across large-scale and institutional cohorts. These findings highlight the prognostic potential of RE methylation profiles and support their integration into molecular classification strategies to improve risk stratification in ccRCC.

19
Sequence alignment of the primate lineage reveals evolutionary divergence and conserved secondary structural motifs in noncoding RNAs

Beeram, A.; Perry, Z. R.; Pyle, A. M.

2026-05-19 bioinformatics 10.64898/2026.05.18.725462 medRxiv
Top 0.7%
0.2%
Show abstract

Long noncoding RNAs (lncRNAs) constitute most of the human transcriptome and perform essential roles in chromatin organization and transcriptional regulation. Because lncRNA genes are not constrained by protein-coding ability, they tend to exhibit more rapid evolutionary divergence. Their poor nucleotide sequence conservation among mammals often led to the assumption that lncRNAs lack conserved structures. However, emerging evidence indicates that many noncoding RNAs adopt secondary and tertiary folds critical for protein recruitment, chromatin binding, and regulation of gene expression. Nevertheless, there are few experimental secondary structures for lncRNAs, hindering mechanistic insight into lncRNA structure-function relationships. Even without available structural data, covariation, in which two nucleotides co-evolve, can provide evidence for conserved structures. This requires sequence alignments with sufficient divergence to detect covariation but enough similarity to maintain alignment quality. Here we report the development of a novel computational pipeline to mine 190 unannotated primate genomes to generate high-quality multiple sequence alignments of noncoding RNAs. This pipeline performs sequence searching, locus extraction, cross-species alignment, and downstream analyses, including assessment of covariation and primary sequence conservation. Ultimately, we demonstrate that because many noncoding elements, such as lncRNAs evolve at a more rapid rate than protein-coding genes, phylogenetic analyses constrained within a narrower evolutionary span can be used to identify conservation of primary sequence and secondary structure. By focusing our alignments on the primate lineage, our method overcomes the limitations of broad phylogenetic analyses, enabling high-resolution detection of subtle conservation patterns and conserved secondary structural motifs of long noncoding RNAs.

20
KSHV miR-K12-9 Induces Transformation of Immortalized and Primary Endothelial Cells

Gay, L. A.; Sirohi, V.; Baddoo, M. C.; Flemington, E.; Tibbetts, S.; Renne, R.

2026-05-21 cancer biology 10.64898/2026.05.18.726106 medRxiv
Top 0.7%
0.2%
Show abstract

Like most herpesviruses, KSHV encodes multiple microRNAs (miRNAs). Collectively, they comprise an important mechanism through which the virus maintains latency and persists in cells. At the same time, individual miRNAs can also play distinct, nonredundant roles. Past experiments with single miRNA knockout viruses showed that miR-K12-9, in particular, filled a unique niche. Endothelial cells latently infected with the miR-K12-9 knockout grew to be many times larger than WT-infected cells and proliferated at a significantly slower rate. Their ability to migrate was slowed as well. RNA-seq identified nearly 8,500 differentially expressed genes between miR-K12-9 knockout- and WT-infected cells. To further study miR-K12-9, we generated Telomerase-Immortalized Microvascular Endothelial (TIME) cells expressing either miR-K12-9 or a control miRNA from a lentivirus. Unexpectedly, after approximately one month in culture, unmistakable morphological changes began to occur in two of the three miR-K12-9-expressing cell lines. These smaller, more rounded cells proliferated rapidly and swiftly took over the two cultures. Given this result, we proceeded to characterize all the lentivirus-transduced cell lines in various assays focused on oncogenesis. When looking at colony formation in soft agar, only those two miR-K12-9-expressing cell lines produced colonies, indicating a loss of contact inhibition. NOD/SCID mice injected with the two cell miR-K12-9-expressing cell lines developed tumors while those receiving other cell lines did not. To confirm reproducibility of these results, we transduced both TIME and primary endothelial cells (HUVECs) with the miR-K12-9 and control lentiviruses. Once again, approximately half of the cell lines expressing miR-K12-9 showed hallmark phenotypes of transformation. We are currently characterizing the miR-K12-9 targetome in the transduced cell lines and mouse tumors using bulk and single-cell RNA-seq. This should yield insights into the underlying mechanism and required cofactors of miR-K12-9-induced transformation. To our knowledge, this is the first description of transformation of endothelial cells by a viral miRNA.